Covariance, correlation matrix and the multi-scale community structure of networks 
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Empirical studies show that real world networks often exhibit multiple scales of topological de- 
scriptions. However, it is still an open problem how to identify the intrinsic multiple scales of 
networks. In this article, we consider detecting the multi-scale community structure of network 
from the perspective of dimension reduction. According to this perspective, a covariance matrix 
of network is defined to uncover the multi-scale community structure through the translation and 
rotation transformations. It is proved that the covariance matrix is the unbiased version of the well- 
known modularity matrix. We then point out that the translation and rotation transformations fail 
to deal with the heterogeneous network, which is very common in nature and society. To address 
this problem, a correlation matrix is proposed through introducing the rescaling transformation 
into the covariance matrix. Extensive tests on real world and artificial networks demonstrate that 
the correlation matrix significantly outperforms the covariance matrix, identically the modularity 
matrix, as regards identifying the multi-scale community structure of network. This work provides a 
novel perspective to the identification of community structure and thus various dimension reduction 
methods might be used for the identification of community structure. Through introducing the 
correlation matrix, we further conclude that the rescaling transformation is crucial to identify the 
multi-scale community structure of network, as well as the translation and rotation transformations. 

PACS numbers: 89.75.Fb, 05.10.-a 



I. INTRODUCTION 

Many real world complex networks, including social 
networks [l|, y], information networks 0, l1 , and biolog- 
ical networks [l|, Q, are found to divide naturally into 
communities, known as groups of nodes with a higher- 
than-average density of edges connecting them. Com- 
munities are of interest because they often correspond 
to functional units such as collections of web pages on a 
single topic. The identification of community structure 
has attracted much attention in various scientific fields. 
Many methods have been proposed and applied success- 
fully to some specific complex networks [6l-ll6l|. For re- 
views, the reader can refer to [17j . 

The community structure of real world networks of- 
ten exhibit multiple scales [181421] . However, the multi- 
scale community structure cannot be uncovered directly 
through traditional methods. Arenas et al [22J ] pointed 
out that the synchronization process reveals topological 
scales of networks and that the spectrum of the Laplacian 
matrix can be used to identify such topological scales. 
Cheng and Shen [23| proposed the network conductance 
to identify the multiple topological scales through inves- 
tigating the diffusion process taking place on networks. 
Recently, in [2J], a general framework is proposed for 
the detection of community structure in multi-scale net- 
works. 

Generally, the straightforward description of network 
topology is a high-dimensional but redundant descrip- 
tion, where each node is taken as one dimension of the 
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network and the edges characterize the relationship be- 
tween these dimensions. The identification of community 
structure can be viewed as finding the most significant re- 
duced dimensions which capture the main characteristics 
of network topology 25] . Different significance levels for 
such dimensions correspond to the community structure 
with different topological scales. 



In this article, we first apply the principal component 
analysis (PC A) to characterize the community structure. 
We show that the covariance matrix behind the PCA is 
the unbiased version of the modularity matrix |8] , which 
underlies the well-known benefit function modularity for 
community detection. From the perspective of dimen- 
sion reduction, however, the covariance matrix only takes 
into account the translation and rotation transforma- 
tions. These two transformations fail to deal with the 
heterogeneous distribution of node degree and commu- 
nity size. To address this problem, a correlation matrix 
is proposed through introducing the rescaling transfor- 
mation into the covariance matrix. Theoretical analysis 
indicates that all these three transformations are cru- 
cial to identify the multi-scale community structure. Fi- 
nally, the effectiveness of the correlation matrix is demon- 
strated through the comparison with the covariance ma- 
trix or the modularity matrix on real world networks and 
artificial benchmark networks. Extensive tests demon- 
strate that the correlation matrix is very effective at un- 
covering the multi-scale community structure of network 
and it significantly outperforms the modularity matrix or 
the covariance matrix, especially when the distribution 
of node degree are heavily heterogenous. The dimension 
reduction perspective opens the door to utilizing various 
dimension reduction techniques for the identification of 
community structure. 



II. COVARIANCE MATRIX OF NETWORK 

A directed network is often described by its adjacency 
matrix A whose element A^ being 1 if there exists an 
edge pointing to node j from node i, and otherwise. 
The node i is called the tail of the edge and the node j 
is called the head of the edge. The out-degree of a node 
is defined as k° ut = J2j ^-ij an d the in-degree is k™ = 
J2i^ij- We suppose that the network has n nodes and 
to edges with to = J^i ^f"* = S 7 ^T- For an undirected 
network, it can be transformed into a directed one by 
replacing each edge with two oppositely directed edges. 
Note that, for a self-loop edge, only one directed edge is 
used to replace the original undirected one. 

Another representation for network is given by two 
node-edge incidence matrices with the size n by to, which 
are respectively defined as 




1 if the node i is the tail of the edge I, 
otherwise, 



1 if the node i is the head of the edge I, 
otherwise. 



(1) 



(2) 



Note that the rows of X (or Y) are mutually orthogonal 
and that the columns each sum to unity. 

As to this representation, n nodes correspond to n di- 
mensions with the ith dimension being denoted by ei, 
whose ith element is 1 and other elements are 0. The 
columns of X or Y can be taken as n-dimensional data 
points distributed in the space spanned by the n di- 
mensions. The mean of these data points in X is de- 
noted by x — £i^2jX*j t where AV,- is the jth col- 
umn of X. Similarly, we give the mean of the data 
points in Y as y = ^jj JZj i*.r According to Eq. (fTJ) 
and Eq. ©, we have x = (kf ut , k% ut , ■■■ , k° n ut ) T /to and 
y = (kf 1 , k % 2 , ■ ■ ■ ,k™) T '/ 77i. Now we subtract off the 
mean x from each column of X and the mean y from each 
column of Y . Such an operation is known as the trans- 
lation transformation and makes the data points mean 
zero. The resulting matrices are denoted by X = X — xt T 
and Y = Y — yt T , where 1 is the m-dimensional vector 
with all its elements being 1. With X and Y, the covari- 
ance between the ith row of X and the jth row of Y can 
be calculated by A« • {Yj*) T /(m — I). Here, the denom- 
inator 77i — 1 is used instead of 777 to make the covariance 
be unbiased. In this way, all these covariances between 
the rows of X and the rows of Y form an n x 77 matrix 



C 



1 



777 — 1 



-XY 1 



(3) 



This matrix is referred to as the covariance matrix of 
network. Its elements arc 



Cij — 



1 



A, 



Uoutuin 



Note that the covariance matrix can be easily extended to 
weighted networks if we consider each weighted edge be- 
tween two nodes as multiple unweighted edges connecting 
them. 

To our surprise, the covariance matrix is identical to 
the modularity matrix except that the denominator in 
the first term is 777 — 1 in the covariance matrix while it 
is 777 in the modularity matrix. In statistics, when cal- 
culating the empirical variance from sample data points 
rather than the distribution itself, to — 1 is used instead of 
777 to make the empirical variance be unbiased. Thus the 
covariance matrix could be taken as the unbiased version 
of the modularity matrix. 



III. SPECTRUM OF THE COVARIANCE 
MATRIX AND COMMUNITY STRUCTURE 

Matrix spectral analysis provides an important tech- 
nique for network division and the identification of com- 
munity structure. For example, as to the Laplacian ma- 
trix, the Fielder's eigenvector [26| is well studied and 
widely used for two-way network partition. Newman pro- 
posed to find the community structure of network using 
the eigenvectors of the modularity matrix [7] . This sec- 
tion aims to illustrate what is told about the community 
structure by the covariance matrix from the perspective 
of dimension reduction. 

It is well known that the traditional covariance matrix 
plays an important role in PCA. By analyzing its eigen- 
values and eigenvectors, the most significant directions 
(or dimensions) in terms of the variance are identified 
as principal components. The lesser significant ones are 
discarded to reduce the number of dimensions and to 
alleviate redundance without losing too much relevant 
information. In this article, the idea similar to PCA is 
adopted to analyze the role of the covariance matrix of 
network at uncovering the community structure of net- 
work. 

Intuitively, as to networks with community structure, 
the tail nodes of edges are expected to be positively cor- 
related with the head nodes of edges (27], |28| . According 
to the definition of covariance, the more two variables 
correlate with each other, the larger the covariance be- 
tween them. Now the task is to find a direction along 
which the covariance between X and Y reaches the max- 
imum. Without loss of generality, we use a normalized 
vector v to denote such a direction. We write v as a lin- 
ear combination of the normalized eigenvectors Vi of the 
covariance matrix C, i.e., v = J^ a^, where the coeffi- 
cients a,i — vj v. Since that v is a normalized vector, we 
have v T v = 1 and this implies that 



£«? 



i. 



(5) 



(4) 



Due to that X and Y have zero means, the covariance 



between them along the direction v can be calculated by 



V = 



771—1 



■j T Cv 



(v'X^v'Y) 1 = 



1 



(v'X^Y'v) 



(£■ 



(> ",'•, 



^CC^djVj) 



ij 



i j j K 



E a ^ Ai 



(6) 



where A^ is the eigenvalue of C corresponding to the 



eigenvector Vi and we have made use of 



= Si 



Without loss of generality, we assume that the eigenval- 
ues are labeled in decreasing order Ai > A 2 > • • • > A ?! Q- 
The task of maximizing V can then be equated with the 
task of choosing the nonnegative quantities af so as to 
place as much as possible of the weight in the sum © 
in the terms corresponding to the most positive eigenval- 
ues, and as little as possible in the terms corresponding 
to the most negative ones, while respecting the normal- 
ization constraint in Eq. (JS|). 

Obviously, V reaches maximum when we set a\ = 1 
and a} = 0(i ^ 1), i.e., the desired direction v is parallel 
to the eigenvector v\. Then, we turn to the next direc- 
tion along which the covariance is maximized with the 
constraint that it is orthogonal to the obtained direction 
v\. According to Eq. ([5]) and Eq. ([6]), such a direction 
is parallel to the eigenvector vi. In the similar way, it 
can be easily shown that all the eigenvectors of the co- 
variance matrix C form a set of orthogonal bases for the 
n-dimensional space, with these dimensions being ranked 
in terms of the covariance between X and Y along them. 

In the following, we will show that the community 
structure of network is reflected by eigenvectors of the co- 
variance matrix of network. Specifically, the eigenvectors 
corresponding to positive eigenvalues make positive con- 
tribution to reflect the community structure of network, 
and the negative ones provide information for the anti- 
community structure of network. This phenomenon has 
been noticed by Newman when studying the spectrum 
of the modularity matrix [8] . Different from Newman's 
work, we illustrate the relation between the covariance 
matrix and the community structure from the perspec- 
tive of dimension reduction. 

We use U to denote the eigenvector matrix with its 
columns being the normalized eigenvectors of the covari- 
ance matrix C ranked in the descending order of the cor- 
responding eigenvalues, which are placed on the diagonal 
of the diagonal matrix D. Then we have 



D = U T CU 
1 



771—1 

1 
771 — 1 



U T (XY T )U 



Tv\T 



(U r X)(U T Y) 



(7) 



1 In the case that the eigenvalues are complex numbers, the eigen- 
values are ranked in the decreasing order of the real part of these 
complex numbers. 




FIG. 1. (Color online) The network of the karate club stud- 
ied by Zachary ,29]] . The real social fission of this network is 
represented by two different shapes, circle and square. Dif- 
ferent colors depict the communities at the other scale. The 
dashed-curve gives another alternative partition of network. 



Multiplying a vector by U T from the left-hand side 
means reexpressing it with respect to the new orthogonal 
bases comprising of the columns of U. Thus, X — U T X 
and Y — U T Y are respectively the new coordinates of 
the original sample data X and Y with respect to the 
new bases. Note that X and Y are respectively the re- 
sults of rotating X and Y around the coordinate origin. 
Thus, X and Y both have zero mean and accordingly the 
matrix D is the covariance matrix for X and Y . All the 
off-diagonal elements of D are zeroes, indicating that X 
only correlates to Y along the same dimension and they 
are linearly independent along different dimensions. If 
X and Y are positively correlated along a certain dimen- 
sion, the corresponding diagonal element of D is positive, 
otherwise negative. Furthermore, the magnitude of the 
eigenvalues provides an indicative index for the signifi- 
cance of the dimensions to reflect the community struc- 
ture of the underlying network. Hereafter, we will neglect 
the constant multiplier l/(m — 1) in the covariance ma- 
trix without losing any information for the detection of 
the community structure. 

To illustrate what is provided about the community 
structure of network by the covariance matrix, as an ex- 
ample, we analyze the covariance matrix of the Zachary's 
karate club network [35| , which is widely used as a bench- 
mark for the methods of community detection. The net- 
work and its community structure are depicted in Fig. [T] 
Table U shows the eigenvectors corresponding to the most 
positive and negative eigenvalues of the covariance ma- 
trix. We can see that the real fission of the club network 
is exactly revealed by the signs of the components in 
the eigenvector corresponding to the most positive eigen- 
value. As to the eigenvector corresponding to the most 
negative one, its components divide the nodes into two 
groups with many connections lying between them, re- 
flecting the so-called anticommunity structure. 

Besides the eigenvector corresponding to the most pos- 
itive eigenvalue, other eigenvectors corresponding to pos- 



TABLE I. Eigenvectors corresponding to the most positive 
and negative eigenvalues of the covariance matrix of Zachary's 
club network 



node 


positive 


negative 


node 


positive 


negative 


1 


-0.3875 


0.4576 


18 


-0.1320 


-0.0720 


2 


-0.2696 


0.1957 


19 


0.1394 


-0.1259 


3 


-0.1319 


0.1371 


20 


-0.0576 


-0.1605 


4 


-0.2535 


0.0376 


21 


0.1394 


-0.1259 


5 


-0.1340 


-0.0127 


22 


-0.1320 


-0.0720 


6 


-0.1457 


0.0018 


23 


0.1394 


-0.1259 


7 


-0.1457 


0.0018 


24 


0.2167 


-0.0540 


8 


-0.2094 


-0.0585 


25 


0.0563 


0.0862 


9 


0.0545 


-0.1490 


26 


0.0754 


0.0867 


in 


0.0479 


-0.0906 


27 


0.1158 


-0.0551 


n 


-0.1340 


-0.0127 


28 


0.1028 


-0.0515 


12 


-0.0778 


-0.0594 


29 


0.0683 


-0.0429 


13 


-0.1287 


-0.0438 


30 


0.2063 


-0.0616 


14 


-0.1350 


-0.1469 


31 


0.0963 


-0.0894 


15 


0.1394 


-0.1259 


32 


0.1019 


-0.1414 


16 


0.1394 


-0.1259 


33 


0.3239 


0.3346 


17 


-0.0585 


0.0442 


34 


0.3698 


0.6200 



itive eigenvalues can also be utilized to reveal the commu- 
nity structure of network. Specifically, the components 
of the first p eigenvectors are used to represent the nodes 
of network into p-dimensional vectors and the commu- 
nity structure is then uncovered through clustering the 
node vectors using, for example, the fc-means clustering 
method. We have known that the original description 
of network topology, i.e., Eq. (JT]) and Eq. ([2]), actually 
uses a set of standard bases. With respect to this set of 
bases, the rth node of network is represented with the 
coordinate vector a whose elements are both but the 
ith one being 1 . The columns of the normalized eigenvec- 
tor matrix U constitute another set of orthogonal bases, 
which can be obtained from the standard bases through 
the rotation transformation, i.e., multiplying the stan- 
dard bases with U T from the left-hand side. With re- 
spect to this new bases, the coordinate vector of the «th 
node can be represented by U T ei, which is the projection 
of the original coordinate vector on the new set of bases. 
Mathematically, the ith projected node vector r^ can be 
denoted by 

Note that the covariances along different new axis direc- 
tion U*j between the data X and Y vary, characterized 
by 

C = UDU T = {UD 1/2 )(UD 1/2 ) T . 

Thus, for the purpose of clustering the node vectors, it is 
more appropriate to represent the ith node with a node 
vector ri with its jth component being 

(8) 



XjUij. 

For negative eigenvalues, the corresponding components 
of the node vector are complex numbers. 



Using Eq. ((5J, we can partition the nodes of network 
into communities through clustering the node vectors r 4 
using the fc-means clustering method. However, only the 
eigenvectors corresponding to positive eigenvalues make 
positive contribution to the community structure. Fur- 
thermore, the eigenvectors corresponding to the small 
positive eigenvalues are less significant to uncover the 
community structure of network. This poses a challeng- 
ing problem, i.e., the choice of the number of eigenvec- 
tors. 

Intuitively, as for the goal of identifying the community 
structure, all the eigenvectors corresponding to positive 
eigenvalues provide certain relevant information. This 
means that the number of positive eigenvalues gives an 
upper bound on the number of eigenvectors used. As 
a kind of mesoscopic structure of network, however, the 
community structure provides a coarse-grained descrip- 
tion of the network topology. Only the most significant 
structural features are maintained and the less ones are 
neglected. Now the tricky problem is how to determine 
the significant ones. In this article, this problem is equiv- 
alent to the choice of the significant eigenvectors. We 
have known that the magnitudes of eigenvalues charac- 
terize the covariance between the data X and Y along 
the direction of the corresponding eigenvectors. Thus 
we can choose the number of significant eigenvectors ac- 
cording to the magnitude of eigenvalues. Intuitively, the 
eigenvectors corresponding to larger positive eigenvalues 
are desired. Furthermore, a large eigengap, i.e., interval 
between two successive eigenvalues, provides an effective 
indicator to determine the appropriate number of signif- 
icant eigenvectors. Similar methods have been adopted 
in other contexts taking the advantage of the eigengap 
of many other types of matrix, inclu ding the adjacency 
matrix l3Cfl , the transition matrix [3ll . 1321 ] , the Laplacian 
matrix [22| , the modularity matrix [8| and the normalized 
Laplacian matrix [23|]. More importantly, the choice of 
eigenvectors with different significance levels determines 
the different scale of community structure. The existence 
of a significant scale is indicated by the occurrence of a 
larger eigengap in the spectrum of the covariance matrix. 

Another important problem is the choice of the number 
of communities. According to Eq. ((SJ, we know that each 
node of network can be represented by a p-dimensional 
node vector through projecting its standard coordinate 
vector on the directions of the p significant eigenvec- 
tors. Then the identification of community structure be- 
comes a problem of partitioning the node vectors into 
groups. According to [8(, the number of communities is 
one greater than the number of orthogonal dimensions to 
represent these vectors, i.e., p+1 is the appropriate num- 
ber of community when the top p eigenvector is adopted 
to obtain the projected node vectors. 

Taking the Zachary's karate club network as example 
again, we illustrate the effectiveness of the eigengap at de- 
termining the number of significant eigenvectors and ac- 
cordingly the number of communities. As shown in Fig. [5] 
(left panel), the largest eigengap (only the ones between 
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FIG. 2. The spectrum of the covariance matrix. Left panel: 
the Zachary's karate club network; Right panel: the H13-4 
network. 



positive eigenvalues are considered) occurs between the 
first and second largest eigenvalues. It indicates that it is 
appropriate to choose only the first eigenvector and the 
number of communities is 2. Furthermore, the resulting 
two communities exactly reflect the real split of the net- 
work in Fig. fT] In addition, besides the largest eigengap, 
two other relative larger eigengaps can be observed, one 
being between the second and the third eigenvalues, and 
the other being between the third and the forth eigen- 
values. The resulting partition according to these two 
eigengaps are also depicted in Fig. [1] one dividing the 
network into three communities separated using dashed 
curves, and the other dividing the network into four com- 
munities differentiated by colors. These two partitions 
are often the results of many community detection meth- 
ods. Although they are not identical to the real split 
of the network, they reveal certain relevant topological 
feature of the network at alternative scales. 

Actually, as to a network with multi-scale community 
structure, each scale corresponds to a large eigengap in 
the spectrum of the covariance matrix. Thus we can 
identify the multi-scale community structure using the 
top eigenvectors indicated by these eigengaps. As an ex- 
ample, we illustrate the identification of the multi-scale 
community structure of the HI 3-4 network, which is con- 
structed according to [22|]. The network has two prede- 
fined hierarchical levels. The first hierarchical level con- 
sists of 4 groups of 64 nodes and the second hierarchical 
level consists of 16 groups of 16 nodes. On average, each 
node has 13 edges connecting to the nodes in the same 
group at the second hierarchical level and has 4 edges 
connecting to the nodes in the same group at the first 
hierarchical level. This explains the name of such kind of 
networks. In addition, the average degree of each node 
is 18. According to the construction rules of the H13-4 
network, the two hierarchical levels constitute the differ- 
ent topological description of the community structure 
of the H13-4 network at different scales. As shown in 
Fig. [2] (right panel), the largest eigengap occurs between 
the 15th and 16th largest eigenvalues, indicating that the 
community structure at the most significant scale corre- 
sponds to the partition dividing the nodes into 16 groups. 



FIG. 3. (Color online) The clique circle network as a 
schematic example. Each circle corresponds to a clique, whose 
size is marked by its label. The cliques labeled with k s are 
smaller cliques with the size s, while the cliques labeled with 
kb are bigger cliques with the size b. In this article, s = 10 
and b = 20. 



Actually, the resulting communities are exactly the pre- 
defined 16 groups of 16 nodes in the second hierarchical 
level. In addition, the second largest eigengap occurs be- 
tween the third eigenvalue and the forth one, indicating 
that the second significant topological scale corresponds 
to the partition dividing the network nodes into 4 groups. 
Again the resulting communities are exactly the prede- 
fined 4 groups of 64 nodes in the first hierarchical level. 



IV. FINDING THE MULTI-SCALE 

COMMUNITY STRUCTURE USING THE 

CORRELATION MATRIX 

The previous section shows that the spectrum of the 
covariance matrix provides an promising way to reveal 
the multi-scale community structure of network. Note 
that, as to the example H13-4 network, the nodes have 
approximately the same degree and the communities at 
a specific scale are of the same size. However, the real 
world networks usually have heterogenous distributions 
of node degree and community size. Thus it will be more 
convincing to test the covariance matrix on networks with 
heterogenous distributions of node degree and commu- 
nity size. 

Before we give such a test in the subsequent section, 
using a schematic network, we first illustrate the ineffec- 
tiveness of the covariance matrix to deal with the het- 
erogeneous distributions of node degree and community 
size. The schematic network is often called the clique cir- 
cle network as depicted in Fig. [3] Generally speaking, the 
intrinsic community structure corresponds to the parti- 
tion where each clique is taken as a community, that is, 
only one intrinsic scale exists in this network. However, 
as shown in Fig. 2] (left panel), two scales are observed 
when we investigate the community structure of this net- 
work using the spectrum of the covariance matrix. One 
scale corresponds the intrinsic scale of the network, and 
the other corresponds to dividing the network nodes into 
5 groups, which is not desired. 

To address this problem, we reconsider the formulation 
of the covariance matrix in Section [TT] and the spectral 
analysis on the covariance matrix in Section IIIII When 
formulating the covariance matrix from the data defined 
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FIG. 4. The spectrum of the covariance matrix (left panel) 
and the correlation matrix (right panel) corresponding to the 
clique circle network depicted in Fig. [3] The horizontal axis 
shows the eigenvalues of matrix and the vertical axis repre- 
sents the rank index of eigenvalues in descending order. 



in Eq. ([lj and Eq. ([2]), the mean is subtracted off to 
place the data around the coordinate origin. This is the 
translation transformation. When using the eigenvectors 
of the covariance matrix as the new orthogonal bases in- 
stead of the standard bases, the rotation transformation 
is adopted. These two transformations, however, both 
do not take into account the difference amongst the vari- 
ances of the original dimensions each corresponding to a 
node. Thus the spectrum of the covariance matrix fails 
to deal with the heterogeneous distribution of node de- 
gree and community size. Fortunately, the possible rem- 
edy is also right here, i.e., the rescaling transformation. 
Through introducing the rescaling transformation into 
the covariance matrix, we obtain a correlation matrix, 
which can be formulated as 



R = (Z x )- 1 C(X Y )-\ 



(9) 



where Sx is a diagonal matrix with its diagonal elements 
(£ x )ii = \A° U *(1 - k° ut /m)/(m - 1) being the empir- 
ical variance of the data X along the ith standard axis 
direction and Ey is a diagonal matrix with its diagonal 

elements (E y )„- = Jkf{l - kf/m)/(m - 1) being the 

empirical variance of the data Y along the jth standard 
axis direction. Specifically, the elements of R are defined 
as 



Rij - 



A; 



k° ut k\ 



yjk° ut (l - k° ut /m)Jk™{l - kf/m) 



(10) 



Compared with the covariance matrix, the correlation 
matrix has two advantages at identifying the multi-scale 
community structure. On one hand, the correlation ma- 
trix can well deal with the heterogeneous distribution of 
node degree and community size. As shown in Fig. |4] 
(right panel) , the intrinsic scale of the clique circle net- 
work is correctly revealed by the spectrum of the cor- 
relation matrix. On the other hand, the eigenvalues of 
the correlation matrix themselves can provide intuitive 
judgements for the cohesiveness within communities and 



the looseness of connections among different communi- 
ties. As shown in Fig.HJ the eigenvalues lying the greater 
side of the largest eigengap all approach I. This indi- 
cates that the intrinsic communities are very cohesive. 
Meanwhile, other eigenvalues are very small, indicating 
the loose connectivity among these communities. Specif- 
ically, through introducing the rescaling transformation 
into the covariance matrix, the eigenvalues of the cor- 
relation matrix are rescaled. Thus, besides the eigen- 
gaps among successive eigenvalues, the eigenvalues them- 
selves also provide indicative information of the commu- 
nity structure of network. This is especially important 
for the networks without significant topological scale. For 
these networks, the eigengaps of the covariance matrix or 
the correlation matrix both fail to provide obvious evi- 
dence for the number of intrinsic communities. Without 
the rescaling transformation, the magnitude of the co- 
variance matrix is influenced by the network size and 
the distribution of node degree. Thus the magnitude it- 
self cannot provide useful information to determine the 
cohesiveness of the detected communities. As to the cor- 
relation matrix, however, the magnitude of the eigenval- 
ues of the correlation matrix has been rescaled and thus 
can provide intuitive knowledge about the cohesiveness 
of communities and then can help us choose the desired 
scale with respect to specific application demands. More 
importantly, the eigenvalues of the correlation matrix can 
be compared among different networks due to that they 
are rescaled and thus not influenced by the network size. 
In the subsequent section, the effectiveness of the corre- 
lation matrix will be further demonstrated through ex- 
tensive tests on artificial benchmark networks and real 
world networks. 

Now we clarify the implication of the correlation ma- 
trix to the widely-used benefit function modularity for 
network partition. As shown in [7|, |8[, the modularity 
matrix (the biased version of the covariance matrix used 
in this article) underlies the benefit function modularity 
and the optimization of the modularity can be carried out 
using the eigenvectors of the modularity matrix. In this 
article, we propose to uncover the community structure 
using the eigenvectors of the correlation matrix, which is 
obtained through introducing the rescaling transforma- 
tion into the covariance matrix. In this way, the benefit 
function for network partition actually changes. Specifi- 
cally, the correlation matrix provides us another benefit 
function for network partition. This benefit function can 
be formulated as 



F 



£E* 



icQjc-K'ij i 



(ii) 



where c denotes a community, and 8i C = I if node i be- 
longs to the community c and otherwise. On one hand, 
the benefit function F can be directly optimized to un- 
cover the community structure similar to the role of the 
modularity. On the other hand, the rationale behind F 
and the modularity are somewhat different. It is well 
known that the success of the modularity is partly from 



its considering the difference between the actual weight 
of the edge connecting node i and j and the expected 
weight of the edge by chance characterized usually us- 
ing a null model, e.g. the configuration model. In this 
way, the modularity takes into account the effects from 
the heterogeneous node degrees. However, the difference 
of the variance associated with each node dimension is 
not taken into account in the definition of modularity. 
Thus the modularity may be blind to smaller communi- 
ties when the network has heterogeneous distribution of 
node degree and community size. This problem roots in 
the heterogeneity of network rather than the existence 
of an intrinsic scale (usually depends on the size of net- 
work) causing the resolution limit problem |18| | of modu- 
larity in the definition of modularity. This heterogeneity- 
caused problem can be well dealt with by the new benefit 
function F since that the underlying correlation matrix 
takes into account the rescaling transformation besides 
the translation transformation and the rotation transfor- 
mation utilized in the the definition of modularity. In 
summary, it is inadequate to only consider the heteroge- 
neous node degree through using a reference null model 
as done by the modularity. A rescaling transformation is 
required to combat such a drawback facing the modular- 
ity. 
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FIG. 5. (Color online) Comparison between the correlation 
matrix and the covariance matrix in terms of their effective- 
ness at identifying the number of communities on benchmark 
networks with different parameter configurations. For each 
parameter configuration, 100 generated networks are used. 



V. EXPERIMENTAL RESULTS 

In this section, we empirically demonstrate the effec- 
tiveness of the multi-scale community detection methods 
based on the spectrum of the covariance matrix and the 
correlation matrix through tests on the artificial bench- 
mark networks. In addition, we apply the correlation 
matrix method on a variety of real world networks. 



A. Tests on artificial benchmark networks 

We utilize the benchmark proposed by Lancichinetti et 
al in [33(| . This benchmark provides networks with het- 
erogeneous distributions of node degree and community 
size. Thus it poses a much more severe test to community 
detection algorithms than standard benchmarks. Many 
parameters are used to control the generated networks 
in this benchmark: the number of nodes N , the average 
node degree (k), the maximum node degree max_fc, the 
mixing ratio fj,, the exponent 7 of the power law distri- 
bution of node degree, the exponent /3 of the power law 
distribution of community size, the minimum community 
size min_c, and the maximum community size max_c. 
In our tests, we use the default parameter configuration 
where N = 1000, (k) = 15, maxjc = 50, min_c = 20, 
and max_c = 50. To test the influence from the dis- 
tribution of node degree and community size, we adopt 
four parameter configurations for 7 and /3, respectively 
being ( 7 ,/3) = (2,1), ( 7 ,/3) = (2,2), ( 7 ,/3) = (3,1) and 
(7, /3) = (3, 2) . By tuning the mixing ratio fi, we test the 



effectiveness of our method on the networks with differ- 
ent fuzziness of communities. The larger the parameter 
/i, the fuzzier the community structure of the generated 
network. In addition, we adopt the normalized mutual 
information (NMI) 34] to compare the partition found 
by community detection methods with the answer parti- 
tion. The larger the NMI, the more effective the tested 
method. 

Note that each benchmark network has only one intrin- 
sic topological scale according to the construction rules. 
Thus we only consider the largest eigengap in the spec- 
trum of the covariance matrix and the correlation matrix. 
The communities are identified using the top p eigenvec- 
tors corresponding to the eigenvalues lying the greater 
side of the largest eigengap. Specifically, the top p eigen- 
vectors are projected into the node vectors according to 
Eq. ([8]) , and then the communities are identified through 
clustering these node vectors using the fc-means cluster- 
ing method. Note that the community number is p + 1. 

The first test focuses on whether the intrinsic scale 
can be correctly uncovered. We investigate this problem 
through judging whether the number of intrinsic com- 
munities can be correctly identified. Fig. [5] shows the 
comparison between the correlation matrix and the co- 
variance matrix (identically the modularity matrix) on 
networks with four different parameter configurations for 
7 and p. When the community structure is evident, i.e., 
the mixing ratio /x is smaller, both the covariance ma- 
trix and the correlation matrix are effective at identifying 
the correct number of communities and thus the intrin- 
sic scale of the network. However, when the community 
structure becomes fuzzier with the increase of the mixing 
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FIG. 6. (Color online) Comparison between the correlation 
matrix and the covariance matrix in terms of their effective- 
ness at identifying the community structure on benchmark 
networks with different parameter configurations. Each point 
corresponds to an average over 100 network realizations. 



ratio [i, the performance of the covariance matrix dete- 
riorates while the correlation matrix still achieves rather 
good results. Even when the mixing ratio [i is larger than 
0.5, the border beyond which communities are no longer 
defined in the strong sense pi| , the number of communi- 
ties can still be accurately identified by investigating the 
spectrum of the correlation matrix. 

The second test turns to whether the intrinsic commu- 
nity structure can be identified. As demonstrated by the 
first test, the correlation matrix outperforms the covari- 
ance matrix at finding the correct number of communi- 
ties. However, as to the second test, we assume that the 
community number has been given a priori and then we 
compare the effectiveness of the eigenvectors of these two 
matrices in terms of the NMI when comparing the answer 
partitions with the ones obtained by the methods based 
on these two matrices. As shown in Fig. [6l these two 
matrices both exhibit very good performance at identi- 
fying the intrinsic community structure. By comparison, 
the correlation matrix outperforms the covariance ma- 
trix for all used parameter configurations. This indicates 
that the eigenvectors of the correlation matrix charac- 
terize the spread characteristics of network nodes better 
than that of the covariance matrix, especially when the 
community structure is fuzzier. 



B. Tests on real world networks 

In the previous subsection, we have demonstrated that 
the correlation matrix is superior to the covariance ma- 
trix at uncovering the intrinsic topological scale of the ar- 
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FIG. 7. (Color online) Tests on real world networks. The 
vertical axis represents the length of the eigengap and the 
horizontal axis represents the the corresponding community 
number indicated by the eigengap. The shaded-circles mark 
the largest eigengap. Note that, in order to identify the com- 
munity structure, only the eigengaps between positive eigen- 
values are taken into account. The vertical dashed lines are 
the place where the zero- valued eigenvalues occur. 



tificial benchmark networks. Now we apply the method 
based on the correlation matrix on real world networks. 
These networks are widely used to evaluate commu- 
nity detection methods. These networks include the 
Zachary's karate club network [29(, the journal index net- 
work constructed in [7| , the social network of dolphin by 
Lusseau et al 35], the college football network of the 
United States [l(, the match network of the NBA teams, 
the network of political books [3g], the network of jazz 
musician 37], the coauthor network of network scientist 
presented in [8| , and the email network of URV [38| . For 



convenience, these networks are respectively abbreviated 
to Zachary, journal, dolphin, football, nba, polbook, jazz, 
netsci and email. The test results on these networks are 
shown in Fig. [7] 

As shown in Fig.0 the eigengaps of the correlation ma- 
trix are very effective at the identification of the commu- 
nity number and accordingly the intrinsic scale of the net- 
work. For most of these networks with known community 
structure, the correlation matrix can accurately uncover 
such community structure including the networks jour- 
nal, dolphin, nba and polbook. For the other networks, 
the correlation matrix also gives very promising results 
for their community structure. Specifically, the detected 
communities reflect the structural and functional charac- 
teristics of each specific network, which can be verified 
through checking the nodes of each community. In ad- 
dition, for the last three networks, large eigengaps can 
be observed among the negative eigenvalues. This indi- 
cates that these networks contain anticommunity struc- 
ture. Actually, this phenomenon is also observed in many 
other real world networks, which are not included in this 
article. This will be discussed in our further work. 

In addition, we also test our method on some real world 



complex networks with larger size, including the protein 
interaction network and the word association network. 
However, no significant eigengap is observed in the spec- 
trum of the covariance matrix and the correlation matrix 
associated with these networks. This indicates that there 
is no scale which is significantly superior to other scales 
and thus no partition of network is more desired. Differ- 
ent from the eigenvalues of the covariance matrix, how- 
ever, besides the length of the eigengaps among eigenval- 
ues, the magnitude itself of the eigenvalues of the corre- 
lation matrix can provide us intuitive knowledge on the 
cohcsiveness of the communities at each specific scale. 
Generally speaking, an eigenvalue larger than 0.5 indi- 
cates the existence of a cohesive node group, i.e., a com- 
munity. Thus, according to the magnitude of eigenval- 
ues, people can choose the topological scale or partition 
of network which is appropriate for their application and 
practical demands. 



VI. CONCLUSIONS AND DISCUSSIONS 

In this article, we have studied the problem of de- 
tecting the multi-scale community structure in networks 
from the perspective of dimension reduction, i.e., from 
the straightforward description taking each node as one 
dimension to a coarse-grained description where each di- 
mension corresponds to a community. The main con- 
tributions of this article are as follows. Firstly, the co- 
variance matrix of network is defined to characterize the 
relationship between original node dimensions through 
the translation transformation. The covariance matrix 
is shown to be the unbiased version of the traditional 
modularity matrix, which underlies the benefit function 
of network partition, namely modularity. Secondly, us- 
ing the rotation transformation, we recognize several re- 
duced dimensions which are significant at reflecting the 
community structure of network. Taking these reduced 
dimensions as new axis vectors, the multi-scale commu- 
nity structure is then identified through investigating 
the spectrum of the covariance matrix and projecting 
the nodes of network into low-dimensional node vectors. 
Thirdly, to deal with the heterogeneous distributions of 
node degree and community size, the correlation matrix 
is proposed through introducing the rescaling transfor- 
mation into the covariance matrix. 

Extensive tests on real world and artificial networks 



demonstrate that the correlation matrix significantly out- 
performs the covariance matrix or the modularity matrix 
as regards revealing the multi-scale community structure 
of network. Note that the computational efforts of the 
correlation matrix are essentially identical to the modu- 
larity matrix. Thus we suggest to use the correlation ma- 
trix and accordingly the benefit function F to detect the 
multi-scale community structure of network, especially 
when the network has heavily heterogeneous distribution 
of node degree and community size. We also hope that 
the readers feel encouraged to apply the correlation ma- 
trix to other real world networks and we look forward to 
seeing the more efficient heuristic methods to optimize 
the benefit function F. 

In addition, we also notice that no significant scale is 
observed in several real world complex networks. In this 
case, the covariance matrix or the modularity matrix fails 
to provide obvious evident about the community struc- 
ture. However, since the magnitude itself of the eigen- 
values of the correlation matrix has indicative meaning 
about the cohesiveness of communities, the correlation 
matrix can still be used to reveal the community struc- 
ture. 

Finally, as to traditional spectral clustering methods, 
the normalized Laplacian matrix is also been suggested 
to replace the standard Laplacian matrix. The basic idea 
of the normalized Laplacian matrix is consistent to the 
rescaling transformation introduced in this article. Com- 
pared to the heuristic normalization transformation on 
the standard Laplacian, the rescaling transformation in 
this article has solid theoretical foundation from the per- 
spective of dimension reduction, for example, the princi- 
pal component analysis. Thus, the work in this article 
might provide insights to understand the spectral clus- 
tering methods and might have important implications 
to the clustering problem dealing with datasets with het- 
erogeneous cluster size. 
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